Exploring the Public Evidence on Open Access Monographs

Micah Altman, CREOS Research Scientist

[Draft: 12/1/2020]

Introduction

There is ongoing tension between the desire of scholars to share their work widely and openly, and the need to fund the infrastructure and labor of publishing. One place in which this tension is most evident is in the sale of scholarly monographs. While they are a only a small fraction of scholarly communications volume, market, and readership – academic monographs continue to play an important role in the humanities and social sciences. They represent an important form of long-form scholarship – not readily expressible through journal-length publications. And, as such, monograph publication through a university press forms a critical component of tenure evaluation – sometimes independent of the extent to which the monograph itself is purchased, read, or cited. (Eve 2014; Crossick 2016)

Die Mitschuldigen – One of the Oldest OA Monographs

Economic Pressures on Monograph Publishing

Monograph publication has been in crisis for approximately two decades. Changes in academic library collection policies — driven, in part, by the serials crisis and the greater integration of purchase-on-demand workflows – have led to traditional monograph publishing becoming generally unprofitable. (Crow, n.d.; Spence 2018) At the same time, there is an increasing demand among scholars, research funders, and the public that the outputs of scholarship be made open access. (Guédon 2019; Science Europe, n.d.)

There are many potential funding models for open monographs(Penier, Izabella, Eve, Martin Paul, and Grady, Tom 2020; Adema, Stone, and Keene, n.d.). Currently, a number of initiatives seek to promote consortial models involving both publishers and groups of libraries. These consortial models include library crowdfunding, membership fees, subscribe-to-open transition, and the direct funding of shared infrastructure. These models act to coordinate disciplinary communities (usually through libraries as representatives); enable publishers to streamline workflow for open digital publication; and reduce potential cost-risk (to publisher and reader).

These initiatives notwithstanding, open access monographs constitute a small fraction of the total monograph titles and in the near future, and will likely make up a few percent of monograph titles published annually. (Grimme et al. 2019)

Reviewing the Evidence

Open monograph publishing remains in its early stages. The CREOS “The Economics of Scholarly Monographs” project is an examination of this area. This fall, as an initial step, we published an annotated bibliography that serves as a jumping off point for scholars to explore the effects of open availability on monograph revenues.

In this blog post we look at the open data available on monograph publication, and use it to explore patterns and trends in open monograph publishing. This blog post takes the form of a guided, interactive, reproducible data analysis based on currently available public data.1 We aim for this exploration to inform libraries, publishers, and authors about the landscape, and prepare for future transitions to open access.

Accessible Data on Open Monographs

The most complete index of open access monographs is the Directory of Open Access Books (DOAB), which lists tens of thousands of individual monographs (also known as ‘titles’). DOAB makes its metadata index available as open data. In the table below you can browse or edit this.

Two other data sources are designed to provide additional information2 specifically about open access monograph titles:

In addition there are a number of publicly accessible (not necessarily open) sources of metadata about large collections books generally. The most notable comprise:

Explorations

Patterns

Publisher Revenues from APC’s

Pricing Changes

Connections

Citations Across the Open Landscape

Puzzles

The exploration above raises a number of questions – under what conditions does the open availability of the monograph impact prices and sales? What are mediating factors – does the length or subject of the monograph mediate sales effects? What are potential mechanisms at play?

This exploration is limited by existing data. Each individual press has information on the sales, costs, and usage of the monographs they publish. If pooled, this data could answer potentially answer deeper questions about the economics and utility academic monographs, and could guide a transition to open access models.

About this Document

This is a reproducible document. The source, along with necessary data is available here: https://github.com/MIT-Informatics/monograph . The code relies primarily on the R language, the Plot.ly graphics package, and open R libraries (especially tidyverse, i2dash, htmlwidgets, and crosstalk). All references in this document are managed in Zotero, The most straightforward way to examine and modify the source is to clone the module using git and then load the project using Rstudio.

The authors describe contributions to this Essay using a standard taxonomy (see [@allen2014]) Micah Altman provided the core formulation of the essay’s goals and aims, and led the writing, methodology, data curation, and visualization. Chris Bourg and Sue Kriegsman contributed to conceptualization and provided review. CREOS research assistant Shelley Choi provided assistance with preliminary data visualization and software implementation.

This work is Copyright 2020 Micah Altman, and is Licensed under the Apache License, Version 2.0 (the “License”); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an “AS IS” BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

This work was conducted with support from the Center for Research and Equitable and Open Scholarship at the Massachusetts Institute of technology.

References

Adema, Janneke, Graham Stone, and Chris Keene. n.d. “Changing Publishing Ecologies: A Landscape Study of New University Presses and Academic-Led Publishing: A Report to JISC,” 103. http://repository.jisc.ac.uk/6666/1/Changing-publishing-ecologies-report.pdf.
Crossick, Geoffrey. 2016. “Monographs and Open Access.” Insights the UKSG Journal 29 (1): 14–19. https://doi.org/10.1629/uksg.280.
Crow, Raym. n.d. “A Rational System for Funding Scholarly Monographs: A White Paper Prepared for the AAU-ARL Task Force on Scholarly Communication,” 34. https://www.arl.org/wp-content/uploads/2012/11/aau-arl-white-paper-rational-system-for-funding-scholarly-monographs-2012.pdf.
Eve, Martin Paul. 2014. Open Access and the Humanities: Contexts, Controversies and the Future. Cambridge, United Kingdom: Cambridge University Press. https://www.cambridge.org/core/services/aop-cambridge-core/content/view/02BD7DB4A5172A864C432DBFD86E5FB4/9781107097896AR.pdf/Open_Access_and_the_Humanities.pdf?event-type=FTLA.
Grimme, Sara, Mike Taylor, Michael A. Elliott, Cathy Holland, Peter Potter, and Charles Watkinson. 2019. “The State of Open Monographs.” https://digitalscience.figshare.com/articles/The_State_of_Open_Monographs/8197625.
Guédon, Jean-Claude. 2019. Future of Scholarly Publishing and Scholarly Communication: Report of the Expert Group to the European Commission. https://doi.org/10.2777/836532.
Penier, Izabella, Eve, Martin Paul, and Grady, Tom. 2020. “COPIM Revenue Models for Open Access Monographs 2020.” https://doi.org/10.5281/ZENODO.4011836.
Science Europe. n.d. “Https://Www.ouvrirlascience.fr/Wp-Content/Uploads/2019/10/SE_on-Open-Access-to-Academic-Books_092019.pdf.” https://www.ouvrirlascience.fr/wp-content/uploads/2019/10/SE_On-Open-Access-to-Academic-Books_092019.pdf.
Spence, Paul. 2018. “The Academic Book and Its Digital Dilemmas.” Convergence: The International Journal of Research into New Media Technologies 24 (5): 458–76. https://doi.org/10.1177/1354856518772029.

  1. Since this blog takes the form of a fully replicable analysis, new versions will be released as the data sources it relies on are updated.↩︎

  2. Note that additional care is necessary when combining multiple data sources about monographs. Monographs are typically uniquely identified through an ISBN, which is also a common choice when linking across databases. However, each ISBN is associated with specific formats (e.g. paper, hardcover, digital), so a single work published in multiple formats will have multiple ISBN’s. Further, the same ISBN may be expressed in multiple forms – so normalization is essential (ISBNtools is useful for this). Finally some databases will use DOI (digital object identifiers) or ASIN (Amazon standard identification number). instead of ISBN. Generally the correspondence across identifiers must be resolved using an index – while is a partial mapping between ASIN’s and ISBN’s – ASIN’s for printed works generally match the ISBN number, but kindle editions (and related digital works) are assigned new ASIN’s.↩︎

  3. There are two other potential general usage sources outside of data collected by individual publishers and platforms: First, oaopen.org (in collaboration with IRUS-UK) collects counter-compliant title-level usage data for the monographs disseminated through its platforms, however this data is restricted to consortium members. Second, the Mellon-supported Data Trust for Open Access E-Book Usage aims to collect and aggregate title-level information for community use – however the project is in its early stages, and no data has yet been made available.↩︎